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DETAILED ACTION 

1. Claims 1-21 have been examined. 

2. It is hereby acknowledged that the following papers have been received 
and placed of record in the file: Application on 01/12/2004 and Preliminary 
Amendment on 06/08/2004. 

Priority 

3. Receipt of papers submitted under 35 U.S.C. 1 19(a)-(d) is acknowledged; 
the papers have been placed on record in the file. The certified copy of 10-2003- 
0005236, filed on 01/27/2003 has been received and placed on record. 

Specification 

4. The title of the invention is not descriptive. A new title is required that is 
clearly indicative of the invention to which the claims are directed. A title that 
indicates thread selection for fetching instructions in a simultaneous 
multithreading (SMT) processor is suggested. 

Claim Rejections - 35 USC § 102 

5. The following is a quotation of the appropriate paragraphs of 35 
U.S.C. 102 that form the basis for the rejections under this section made in this 
Office action: 

A person shall be entitled to a patent unless - 

(b) the Invention was patented or described in a printed publication in this or a foreign country or in 
public use or on sale in this country, more than one year prior to the date of application for patent in 
the United States. 
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6. Claims 1-21 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Emer et al., U.S. Patent No. 6,073,159 (Herein referred to as Emer) 

7. Referring to claim 1 , Emer has taught a multi-threaded processor, 
comprising: 

a processing pipeline [CPU; Fig. 1, component 12] including a number of 
stages [column 3, lines 48-51], each stage processing at least one instruction, 
each instruction belonging to one of a plurality of threads [column 5, lines 31-46]: 
and 

a fetch unit [Fig. 2, component 20] forming one of the stages of the 
pipeline [fetch stage; Fig. 3, component 42] and determining from which thread to 
fetch an instruction for processing by the processing pipeline [column 3, lines 62- 
67/ the fetch unit receiving information from at least one other stage of the 
processing pipeline and determining a processing time of the processing pipeline 
occupied by each thread based on the received information [The fetch unit 
receives thread attribute information and determines an estimated processing 
time of each thread (column 6, lines 22- 45)], the fetch unit determining from 
which thread to fetch an instruction for processing by the processing pipeline 
based on the determined processing time for each thread [Instructions are 
fetched from the thread determined to have the most beneficial estimated 
processing time (column 6, lines 22-45)]. 

8. Referring to claim 2, has taught the processor of claim 1 , wherein the fetch 
unit determines the thread having the smallest determined processing time as 
the thread from which to fetch an instruction for processing [The thread 
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determined to have the fewest data caches misses and, therefore,- the smallest 
determined processing time is selected as the thread from which to fetch (column 
7, line 65 - column 8, line 8)]. 

9. Referring to claim 3, has taught the processor of claim 1 , wherein the 
received information includes the operation type of instructions in the processing 
pipeline [The operation type is used to determine if an instruction in the pipeline 
is a branch (column 9, lines 1-15)]. 

10. Referring to claim 4, has taught the processor of claim 3, wherein the 
received information further includes the operation type of instructions leaving the 
processing pipeline [The operation type is used to determine if an instruction 
leaving the queue stage of the pipeline is a branch (column 9, lines 1-15)]. 

1 1 . Referring to claim 5, has taught the processor of claim 4, wherein the fetch 
unit includes a counter associated with each thread [column 8, lines 35-39], each 
counter being incremented by a processing time associated with each instruction 
of the associated thread in the processing pipeline and being decremented by a 
processing time associated with each instruction of the associated thread leaving 
the processing pipeline [For each instruction of a thread in the pipeline a bit in the 
thread's bit vector is set, thereby incrementing the tally counter associated with 
the thread by one (a processing time unit). Similarly, for each instruction of a 
thread leaving the pipeline a bit in the thread's bit vector is cleared, thereby 
decrementing the tally counter associated with the thread by one (column 9, line 
62 - column 10, line 9)] 
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12. Referring to claim 6, has taught the processor of claim 5, wherein the fetch 
unit determines the thread associated with the counter having a smallest count 
value as the thread from which to fetch an instruction for processing [column 10, 
lines 7-9]. 

1 3. Referring to claim 7, has taught the processor of claim 1 , wherein the fetch 
unit generates a weighted instruction count for each thread as the determined 
processing time of each thread [column 13, lines 54-67], the weighted instruction 
count for a thread is a count of the instructions for the thread in the processing 
pipeline with each instruction weighted by the cycle counts [constants CJ 
associated with processing the instruction [column 13, line 66 - column 14, line 
26], 

14. Referring to claim 8, has taught the processor of claim 7, wherein the fetch 
unit includes a counter associated with each thread [column 8, lines 35-39], each 
counter being incremented by the cycle counts associated with each instruction 
of the associated thread in the processing pipeline and being decremented by 
the cycle counts associated with each instruction of the associated thread leaving 
the processing pipeline [For each instruction of a thread in the pipeline a bit in the 
thread's bit vector is set, thereby incrementing the tally counter associated with 
the thread by cycle counts (after weighting). Similarly, for each instruction of a 
thread leaving the pipeline a bit in the thread's bit vector is cleared, thereby 
decrementing the tally counter associated with the thread by cycle counts 
(column 9, line 62 - column 10, line 9)]. 
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15. Referring to claim 9, has taught the processor of claim 8, wherein the fetch 
unit determines the thread associated with the counter having a smallest count 
value as the thread from which to fetch an instruction for processing [column 10, 
lines 7-9], 

16. Referring to claim 1 0, has taught the processor of claim 1 , wherein the 
processing pipeline comprises: 

an instruction decoder [Fig. 2, component 26] decoding instructions, which 
the fetch unit determines to fetch [column 4, lines 30-31], to generate at least an 
operation type of the instruction as decoder information [The op code (i.e. 
operation type) is generated from the instruction code; column 4, lines 31-35]] 
and 

a queue [instruction queue; Fig. 2, component 30] storing the decoded 
instructions and issuing decoded instructions to an execution unit for execution 
[column 4, lines 54-64]. 

17. Referring to claim 11, has taught the processor of claim 10, wherein the 
received information is the decoder information and the issued decoded 
instructions [column 9, lines 1-15]. 

18. Referring to claim 1 2, has taught the processor of claim 1 , wherein the 
processing pipeline further comprises: 

an instruction cache [Fig. 2, component 24] storing instructions, and 
outputting an instruction to the instruction decoder based on which instruction the 
fetch unit determines to fetch [column 3, lines 58-61]\ and 
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an address renamer [register renamer; Fig. 2, component 28] mapping a 
logical address generated by the instruction decoder for an instruction into a real 
address of a memory device in an execution unit [The register renamer maps a 
logical register number (i.e. logical address) into a physical register number (i.e. 
real address); column 4, lines 43-53]. 

19. Referring to claim 13, given the similarities between claim 1 and claim 13 
the arguments as stated for the rejection of claim 1 also apply to claim 13. 

20. Referring to claim 14, given the similarities between claim 2 and claim 14 
the arguments as stated for the rejection of claim 2 also apply to claim 14. 

21. Referring to claim 15, given the similarities between claim 3 and claim 15 
the arguments as stated for the rejection of claim 3 also apply to claim 15. 

22. Referring to claim 16, given the similarities between claim 4 and claim 16 
the arguments as stated for the rejection of claim 4 also apply to claim 16. 

23. Referring to claim 17, given the similarities between claim 5 and claim 17 
the arguments as stated for the rejection of claim 5 also apply to claim 17. 

24. Referring to claim 18, given the similarities between claim 6 and claim 18 
the arguments as stated for the rejection of claim 6 also apply to claim 18. 

25. Referring to claim 19, given the similarities between claim 7 and claim 19 
the arguments as stated for the rejection of claim 7 also apply to claim 19. 

26. Referring to claim 20, given the similarities between claim 8 and claim 20 
the arguments as stated for the rejection of claim 8 also apply to claim 20. 

27. Referring to claim 21 , given the similarities between claim 9 and claim 21 
the arguments as stated for the rejection of claim 9 also apply to claim 21 . 
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28. Claims 1 and 13 are rejected under 35 U.S.C. 102(b) as being anticipated 
by Borkenhagen et al., U.S. Patent No. 6,076,157 (Herein' referred to as 
Borkenhagen ). 

29. Referring to claim 1 , Borkenhagen has taught a multi-threaded processor, 
comprising: 

a processing pipeline [processor core; Fig. 1, component 100] including a 
number of stages [The processor core is pipelined (column 7, lines 15-17), 
which, by definition, inherently necessitates a number of stages], each stage 
processing at least one instruction, each instruction belonging to one of a 
plurality of threads [column 7 t lines 61-67]; and 

a fetch unit [instruction unit; Fig. 2, component 220] forming one of the 
stages of the pipeline and determining from which thread to fetch an instruction 
for processing by the processing pipeline [column 7, lines 61-67], the fetch unit 
receiving information from at least one other stage of the processing pipeline and 
determining a processing time of the processing pipeline occupied by each 
thread based on the received information [The thread switch time-out register, 
which is part of the instruction unit (column 7, lines 29-33), determines the 
allowed amount of processing time of the processing pipeline occupied by each 
thread (column 14, lines 51-55) based upon a value received (i.e. information 
received) and loaded into the thread switch time-out register; See column 15, 
lines 1-22], the fetch unit determining from which thread to fetch an instruction for 



Application/Control Number: 10/754,550 Page 9 

Art Unit: 2181 

processing by the processing pipeline based on the determined processing time 
for each thread [When the determined processing time is zero (i.e. the thread 
switch time-out register is zero), a thread switch occurs and the instruction unit 
determines to fetch from another thread; See column 15, lines 1-22]. 
30. Referring to claim 1 3, Borkenhagen has taught a method of fetching 
instructions for processing in a multi-threaded processor, comprising: 

receiving, at a fetch unit of a processing pipeline [processor core; Fig. 1, 
component 100], information from at least one other stage of the processing 
pipeline [A value to loaded into the thread switch time-out register; See column 
15, lines 1-22], the processing pipeline including a number of stages [The 
processor core is pipelined (column 7, lines 15-17), which, by definition, . 
inherently necessitates a number of stages], each stage processing at least one 
instruction, each instruction belonging to one of a plurality of threads [column 7, 
lines 67-67/; 

first determining a processing time of the processing pipeline occupied by 
each thread based on the received information [The thread switch time-out 
register, which is part of the instruction unit (column 7, lines 29-33), determines 
the allowed amount of processing time of the processing pipeline occupied by 
each thread (column 14, lines 51-55) based upon value loaded into the thread 
switch time-out register; See column 15, lines 1-22]\ and 

second determining from which thread to fetch an instruction for 
processing by the processing pipeline based on the determined processing time 
for each thread [When the determined processing time is zero (i.e. the thread 
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switch time-out register is zero), a thread switch occurs and the instruction unit 
determines to fetch from another thread; See column 15, lines 1-22]. 

Conclusion 

31. The following is text cited from 37 CFR 1.111(c): In amending in reply to a 
rejection of claims in an application or patent under reexamination, the applicant 
or patent owner must clearly point out the patentable novelty which he or she 
thinks the claims present in view of the state of the art disclosed by the 
references cited or the objections made. The applicant or patent owner must 
also show how the amendments avoid such references or objections. 

32. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Cota-Robles, U.S. Patent No. 6,658,447, has taught a fetch method for a 
simultaneous multithreading processor that combines dynamic thread priorities 
with thread attribute counts. 

Tullsen et al., "Exploiting Choice: Instruction Fetch and Issue on an 
Implementable Simultaneous Multithreading Processor", has taught fetch 
selection techniques using dynamic thread attributes. 

El-Moursy et al., "Front-End Policies for Improved Issue Efficiency in SMT 

Processors", has taught fetch selection techniques using dynamic thread 

* 

attributes. 
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Luo et al., "Balancing Throughput and Fairness in SMT Processors", has 
taught fetch selection techniques that account for fairness. 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Benjamin P. Geib whose telephone number is 
(571) 272-8628. The examiner can normally be reached on Mon-Fri 8:30am- 
5:00pm. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Fritz Fleming can be reached on (571) 272-4145. The fax 
phone number for the organization where this application or proceeding is 
assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll- 
free). 



Benjamin P Geib 
Examiner 




